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THE SCOPE OF THIS DOCUMENT is limited to a functional 
description of the Atari ST BLiTTER. This document is not a 
data sheet for system integration, rather it is a user 
manual for system programming. For more information, please 
refer to the texts listed at the end of this document. 


1 . Introduction 

The Atari ST Bit-Block Transfer Processor (BLiTTER) is 
a hardware implementation of the bit-block transfer (BitBlt 
aka blit) algorithm. BitBlt can be simply described as a 
procedure that moves bit-aligned data from a source location 
to a destination location through a given logic operation. 
The BitBlt primitive can be used to perform such operations 
as : 

o Area seed filling 

o Rotation by recursive subdivision 
o Slice and smear magnification 
o Brush line drawing using Bresenham DDA 
o Text transformations eg bold, italic, outline 
o Text scrolling 
o Window updating 
o Pattern filling 


And general memory-to-memory block copying [1] . 

The heart of BitBlt was first formally defined by New- 
man and Sproull in their description of the function 
RasterOp [2]. As defined, RasterOp performed its block 
transfers on a bit-by-bit basis and was limited to a small 
subset of possible source and destination Boolean combina- 
tions. Enhancements to RasterOp such as processing bits in 
parallel or introducing a halftone pattern into the transfer 
were literally left as exercises for the reader. 

In an effort to improve the functionality and perfor- 
mance of the original algorithm, the prescribed enhancements 
were incorporated into the definition of RasterOp and imple- 
mented in hardware as the RasterOp Chip [3] . However the 
RasterOp Chip lacked the two-dimensionality of the original 
function and suffered from a performance bottleneck caused 
by the loading and reloading of source, destination, and 
halftone data (ie it could not DMA). 

While efforts were being made to improve the perfor- 
mance of RasterOp, the formal definition of RasterOp was 
further refined and became the basis of the BitBlt copyLoop 
primitive in the Smalltalk-80 graphics kernel [4]. Because 
of its comprehensive interface definition, the BitBlt 
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primitive was inefficient and required special-case oPjimi 
rations that violated its general-purpose nature * .SSJflScS 
hardware solution was necessary to increase the P e **?™*£® e 
of the BitBlt copyLoop without sacrificing its functions-. 

ity. 

The Atari ST BLiTTER is a hardware solution to the per- 
formance problems of BitBlt. The BLiTTER is a DMA device 
that implements the full BitBlt copyLoop def ini ti^ with the 
addition of a few minor extensions. Single word or multi- 
word increments and decrements are provided for transfers to 
destinations in Atari ST video display memory. A center- 
mask, which would otherwise be a constant all ones, 
provided for an additional level of texture. The remainder 
of this document is J irectly based on the original func- 
tional description of the Atari ST BLiTTER. 
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2. Bit-Block Transfers 

As previously stated, a bit-block transfer can be 
described as a procedure that moves bit-aligned data from a 
source location to a destination location through a given 
logic operation. There are sixteen logic combination rules 
associated with the merging of source and destination data. 
Note that this set contains all possible combinations 
between source and destination. The following table con- 
tains the valid BitBlt combination rules: 


LOGIC OPERATIONS 


(~s&~d)|rs&d) I (s&~d) I ( s&d) 

MSB LSB 


0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

1 

1 

1 

1 

1 

1 


0 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

0 

1 

1 

1 

1 


0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 


0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 


OP 

COMBINATION RULE 

0 

all zeros 

1 

source AND destination 

2 

source AND NOT destination 

3 

source 

4 

NOT source AND destination 

5 

destination 

6 

source XOR destination 

7 

source OR destination 

8 

NOT source AND NOT destination 

9 

NOT source XOR destination 

A 

NOT destination 

B 

source OR NOT destination 

C 

NOT source 

D 

NOT source OR destination 

E 

NOT source OR NOT destination 

F 

all ones 


Adjustments to block extents and several other transfer 
parameters are determined prior to the invocation of the 
actual block transfer. These adjustments and parameters 
include clipping, skew, end masks, and overlap. 

Clipping. The source and destination block extents are 
adjusted to conform with a specified clipping rectangle. 
Since both source and destination blocks are of equal dimen- 
sion, the destination block extent is clipped to the 
of the source block (or vice versa). Note that the block 
transfer need not be performed if the resultant extent is 
zero. 


Skew. The source- to-destination horizontal bit skew is 
calculated. 
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End Masks. The left and right partial word masks are 
determined. The masks are merged if the destination is one 
word in width. 

Overlap. The block locations are checked for possible 
overlap in order to avoid the destruction of source data 
before it is transferred. 

In non-overlapping transfers the source block scanning 
direction is inconsequential and can by default be from 
upper left to lower right. In overlapping transfers the 
source scanning direction is also from upper left to lower 
right if the source-to-destination transfer direction is up 
and/or to the left (ie source address is greater than or 
equal to destination address). However, if the overlapping 
source-to-destination transfer direction is down and/or to 
the right (ie source address is less than destination 
address), then the source data is scanned from lower right 
to upper left. 

After the transfer parameters are determined the bit- 
block transfer operation can be invoked, transferring source 
to destination through the logic operation (HALFTONE and HOP 
will be described in the next section): 


IT-BLOCK TRANSFER 


HALFTONE 


SOURCE 

SOURCE 


DESTINATION 

| I<< SKEW | 

1 


HOP 


LOGIC OP 



ENDh 

1ASK 




NEW DESTINATION 
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3 . Functional Description 

Please refer to the bit-block transfer diagram in the 
previous section. To understand how the components of a 
block transfer work, let's look at the simplest possible 
transfer. Take the case where we wish to fill a block of 
memory with either all zeros or all ones (OP * 0 or OP = F) . 
In this case only the LOGIC OP block, which generates the 
ones or zeros, and the ENDMASK block are in the data path. 
If the end mask contains all ones, the BLiTTER will simply 
write one word after the other to the destination address 
without ever reading the destination. 

As the writes take place the destination address will 
be adjusted according to the values in the DESTINATION X 
INCREMENT, DESTINATION Y INCREMENT, X COUNT, and Y COUNT 
registers. These registers define the size and shape of the 
block to be transferred. The X and Y COUNT registers define 
the size of the block. The X COUNT register specifies the 
number of word-size writes required to update one line of 
the destination. The Y COUNT register specifies the number 
of these lines in the block. The DESTINATION X INCREMENT 
register is a signed (2's complement) 16-bit quantity which 
is added to the destination address to calculate the address 
of the next destination word of the line. On the last write 
of the line the DESTINATION Y INCREMENT is added to calcu- 
late the address of the first word of the next line. 

The end mask determines which bits of the destination 
word will be updated. Bits of the destination which 
correspond to ones in the end mask will be updated. Bits of 
the destination which correspond to zeros in the end mask 
will remain unchanged. Note that if any bits of the desti- 
nation are to be left unchanged, a read-modify-write is 
required. In order to improve performance a read will only 
be performed if it is required. There are three ENDMASK 
registers numbered 1 through 3. ENDMASK 1 is used only for 
the first write of the line. ENDMASK 3 is used only for the 
last write of the line. ENDMASK 2 is used for all other 
writes . 

Now let's consider a more complicated case, suppose we 
want to XOR a destination block with a 16x16 halftone pat- 
tern. First we load the HALFTONE RAM with the halftone pat- 
tern. Select halftone only using the HOP register and 
select source XOR destination using the OP register. The 
LINE NUMBER register is used to specify which of the 16 
words of HALFTONE RAM is used for the current line. This 
register will be incremented or decremented at the end of 
each line according to the sign of the DESTINATION Y INCRE- 
MENT register. Set the DESTINATION X and Y INCREMENT and X 
and Y COUNT registers to the appropriate values and start 
the transfer. This same procedure can be followed to do the 
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combination using any logic operation by simply changing the 
value in the OP register. Similarly the combination can be 
performed using a source block instead of the HALFTONE RAM 
or using the logical AND of a source block and the HALFTONE 
RAM by changing the value of the HOP register. A source 
block is the same size as the destination block but may have 
different increments and address defined by the SOURCE X and 
Y INCREMENT and SOURCE ADDRESS registers. 

Finally, let's look at the case when the source and 
destination blocks are not bit-aligned. In this case we may 
need to read the first two source words into the 32-bit 
source buffer and use the 16 bits that line up with the 
appropriate bits of the destination, as specified by the 
SKEW register. When the next source word is read, the lower 
16 bits of the source buffer is transferred to the upper 16 
bits and the lower is replaced by the new data. This pro- 
cess is reversed when the source is being read from the 
right to the left (SOURCE X INCREMENT negative). Since 
there are cases when it may be necessary for an extra source 
read to be performed at the beginning of each line to 
"prime" the source buffer and cases when it may not be 
necessary due to the choice of end mask, a bit has been pro- 
vided which forces the extra read. The FXSR (aka pre-fetch) 
bit in the SKEW register indicates, when set, that an extra 
source read should be performed at the beginning of each 
line to "prime" the source buffer. Similarly the NFSR (aka 
post-flush) bit, when set, will prevent the last source read 
of the line. This read may not be necessary with certain 
combinations of end masks and skews. If the read is 
suppressed, the lower to upper half buffer transfer still 
occurs. Also in this case, a read-modify-write cycle is 
performed on the destination for the last write of each line 
regardless of the value of the corresponding ENDMASK regis- 
ter. 
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4. Programming Model 

The BLITTER contains a set of registers that specify 
bit-block addresses, bit-block alignments, logic and 
halftone operations, and bus accesses. The register set-up 
time remains practically constant and is large relative to 
small block transfers, whereas large bit-blocks are dom- 
inated by the execution time of the transfer itself. 

4.1. Register Map 

The following is a map of the BLiTTER programmable 
registers (note that all unused bits read back as zeros): 


REGISTER MAP 

FF 8A00 
FF 8A02 
FF 8A04 

FF 8A1E 
FF 8A20 
FF 8A22 
FF 8A24 
FF 8A26 
FF 8A28 
FF 8A2A 
FF 8A2C 
FF 8A2E 
FF 8A30 
FF 8 A3 2 
FF 8A34 
FF 8 A3 6 
FF 8 A3 8 

FF 8A3A 
FF 8A3B 

FF 8A3C 


FF 8A3D 


oooooooo 

oooooooo 

oooooooo 

oooooooo 

oooooooo 

oooooooo 


oooooooo 

oooooooo 

oooooooo 

oooooooo 

oooooooo 

oooooooo 


oooooooo 

oooooooo 

oooooooo 


oooooooo 

oooooooo 

oooooooo 

( 

( 

oooooooo 1 

ooooooo- 

ooooooo- 

oooooooo 

ooooooo- 

oooooooo 

oooooooo 

oooooooo 

ooooooo- 

ooooooo- 

oooooooo 

ooooooo- 

oooooooo 

oooooooo 


HALFTONE RAM 


SOURCE X INCREMENT 
SOURCE Y INCREMENT 
SOURCE ADDRESS 

ENDMASK 1 
ENDMASK 2 
ENDMASK 3 

DESTINATION X INCREMENT 
DESTINATION Y INCREMENT 
DESTINATION ADDRESS 

X COUNT 
Y COUNT 


oo HOP 

oooo OP 

I 000-0000 I 

III LINE NUMBER 

| SMUDGE 

HOG 

BUSY 


loo — oooo 

|| SKEW 

NFSR 

FXSR 
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4.2. Bit-Block Addresses 

This subsection describes registers that specify bit- 

block origins, address increments, and extents. 

SOURCE ADDRESS ^ 

This 23-bit register contains the current address of 
the source field (only word addresses may be speci- 
fied). It may be accessed using either word or long- 
word instructions. The value read back is always the 
address of the next word to be used in a source opera- 
tion. It will be updated by the amounts specified in 
the SOURCE X INCREMENT and the SOURCE Y INCREMENT 
registers as the transfer progresses. 

SOURCE X INCREMENT . . _ _ . 

This is a signed 15-bit register, the least significant 
bit is ignored, specifying the offset in bytes to the 
address of the next source word in the current line. 
This value will be sign-extended and added to the 

SOURCE ADDRESS register at the end of a source word 
fetch, whenever the X COUNT register does not contain a 
value of one. If the X COUNT register is loaded with a 
value of one this register is not used. Byte instruc- 
tions can not be used to read or write this register. 

SOURCE Y INCREMENT , , „ 

This is a signed 15-bit register, the least significant 
bit is ignored, specifying the offset in bytes to the 
address of the first source word in the next line. 
This value will be sign-extended and added to the 

SOURCE ADDRESS register at the end of the last source 
word fetch of each line (when the X COUNT register con- 
tains a value of one). If the X COUNT register is 

loaded with a value of one this register is used 

exclusively. Byte instructions can not be used to read 
or write this register. 

DESTINATION ADDRESS 

This 23-bit register contains the current address of 
the destination field (only word addresses may be 
specified ) . It may be accessed using either word or 
long-word instructions. The value read back is always 
the address of the next word to be modified in the des- 
tination field. It will be updated by the amounts 
Specified in the DESTINATION X INCREMENT and the DESTI- 
NATION Y INCREMENT registers as the transfer 
progresses. 

DESTINATION X INCREMENT 

This is a signed 15-bit register, the least significant 
bit is ignored, specifying the offset in bytes to the 
address of the next destination word in the current 
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line. This value will be sign-extended and added to 
the DESTINATION ADDRESS register at the end of a desti- 
nation word write, whenever the X COUNT register does 
not contain a value of one. If the X COUNT register is 
loaded with a value of one this register is not used. 
Byte instructions can not be used to read or write this 
register. 

DESTINATION Y INCREMENT 

This is a signed 15-bit register, the least significant 
bit is ignored, specifying the offset in bytes to the 
address of the first destination word in the next line. 
This value will be sign-extended and added to the DES- 
TINATION ADDRESS register at the end of the last desti- 
nation word write of each line (when the X COUNT regis- 
ter contains a value of one). If the X COUNT register 
is loaded with a value of one this register is used 
exclusively. Byte instructions can not be used to read 
or write this register. 

X COUNT 

This 16-bit register specifies the number of words con- 
tained in one destination line. The minimum number is 
one and the maximum is 65536 designated by zero. Byte 
Instructions can not be used to read or write this 
register. Reading this register returns the number of 
destination words yet to be written in the current 
line, NOT necessarily the value initially written to 
the register. Each time a destination word is written 
the value will be decremented until it reaches zero, at 
which time it will be returned to its initial value. 

Y COUNT 

This 16-bit register specifies the number of lines in 
the destination field. The minimum number is one and 
the maximum is 65536 designated by zero. Byte instruc- 
tions can not be used to read or write this register. 
Reading this register returns the number of destination 
lines yet to be written, NOT necessarily the value ini- 
tially written to the register. Each time a destina- 
tion line is completed the value will be decremented 
until it reaches zero, at which time the tranfer is 
complete. 
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4.3. Bit-Block Alignments 

This subsection describes registers that specify bit- 
block end masks, source-to-destination skew, and source data 
fetching. 

ENDMASK 1, 2, 3 

These 16-bit registers are used to mask destination 
writes. Bits of the destination word which correspond 
to ones in the current ENDMASK register will be modi- 
fied. Bits of the destination word which correspond to 
zeros in the current ENDMASK register will remain 
unchanged. The current ENDMASK register is determined 
by position in the line. ENDMASK 1 is used only for 
the first write of a line. ENDMASK 3 is used only for 
the last write of a line. ENDMASK 2 is used in all 
other cases. In the case of a one word line ENDMASK 1 
is used. Byte instructions can not be used to read or 
write these registers. 


SKEW 

The least significant four bits of the byte-wide regis- 
ter at FF 8A3D specify the source skew. This is the 
amount the data in the source data latch is shifted 
right before being combined with the halftone mask and 
destination data. 


FXSR 

FXSR stands for Force extra Source Read. When this bit 
is set one extra source read is performed at the start 
of each line to Initialize the remainder portion source 
data latch. 


NFSR 

NFSR stands for No Final Source Read. When this bit is 
set the last source read of each line is not performed. 
Note that use of this and/or the FXSR bit the requires 
an adjustment to the SOURCE Y INCREMENT and SOURCE 
ADDRESS registers. 
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4.4. Logic Operations 

This subsection describes registers that specify the 
logic combinations of source and destination bit-block data. 

OP 

The least significant four bits of the byte-wide regis- 
ter at FF 8A3B specify the source/destination combina- 
tion rule according to the following table: 


LOGIC OPERATIONS 


OP 

COMBINATION RULE 

0 

all zeros 

1 

source AND destination 

2 

source AND NOT destination 

3 

source 

4 

NOT source AND destination 

5 

destination 

6 

source XOR destination 

7 

source OR destination 

8 

NOT source AND NOT destination 

9 

NOT source XOR destination 

A 

NOT destination 

B 

source OR NOT destination 

C 

NOT source 

D 

NOT source OR destination 

E 

NOT source OR NOT destination 

F 

all ones 
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4.5. Hal f tone Operations 

This subsection describes registers that specify the 
halftone pattern memory, halftone word index, and combina- 
tions of source and halftone data. 


HALFTONE RAM , _ . 

This RAM holds a 16x16 halftone pattern mask. Each 
word is valid for one line of the destination field and 
is repeated every 16 lines. The current word is 
pointed to by the value in the LINE NUMBER register. 
These registers may be read, but can not be accessed 
using byte-wide instructions. 


The least significant four bits of the byte-wide regis- 
ter at FF 8A3C specify the current halftone mask. The 
current value times two plus FF8A00 gives the address 
of the current halftone mask. This value is incre- 
mented or decremented at the end of each line and will 
wrap through zero. The sign of the DESTINATION Y 
INCREMENT determines if the line number is incremented 
or decremented (increment if positive, decrement if 
negative ) . 

SMUDGE . . ... . 

The SMUDGE bit, when set, causes the least significant 
four bits of the skewed source data to be used as the 
address of the current halftone pattern. Note that the 
halftone operation is still valid when SMUDGE is set. 


The least significant two bits of the byte-wide regis- 
ter at FF 8A3A specify the source /ha If tone combination 
rule according to the following table: 


HALFTONE OPERATIONS 


HOP 

COMBINATION RULE 

0 

all ones 

1 

halftone 

2 

source 

3 

source AND halftone 
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4.6. Bus Accesses 

This subsection describes registers that specify bus 
access control and BLiTTER start /status. 


The HOG bit, when cleared, causes the processor and the 
BLiTTER to share the bus equally, in this mode each 
will get 64 bus cycles while the other is halted. When 
set, the bit will cause the processor to be halted 
until the transfer is complete. In either case the 
BLiTTER will yield to other DMA devices. Bus arbitra- 
tion may allow the processor to execute one or more 
instructions even in hog mode. Therefore, don't assume 
that the instruction following the one which sets the 
BUSY bit will be executed only after the transfer is 
complete. The BUSY bit may be polled to achieve this 
kind of synchronization. 


BUSY , ^ 

The BUSY bit is set after all the other registers have 
been initialized to begin the transfer operation. It 
will remain set until the transfer is complete. The 
interrupt line is a duplicate of this bit. See the 
Programming Example for more details on how to use the 
BUSY bit. 
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Appendix A — Programming Example 


In order to maintain software compatibility with new or 
upgraded Atari STs equipped with the BLITTER, software 
developers need only follow guidelines set forth by the VDI 
and "LINE A" documents. Revised TOS ROMs will work in con- 
cert with the BLITTER, enhancing the performance of many VDI 
and "LINE A" operations. This occurs in a manner tran- 
sparent to an executing program. Thus no special actions 
need be taken to utilize the performance advantages of the 
BLiTTER. 

As a rule of thumb, never make a VDI or "LINE A" call 
from within an interrupt context since unpredictable and 
potentially catastrophic results will occur should one 
BLiTTER operation interrupt another BLiTTER operation. 

The following program has not been optimized and is 
presented here for exemplary purposes only. 


* (c) 1987 Atari Corporation 

* All Rights Reserved. 

* BLiTTER BASE ADDRESS 

BLiTTER equ $FF8A00 

* BLiTTER REGISTER OFFSETS 


Halftone 

equ 

0 

Src_Xinc 

equ 

32 

Src Yinc 

equ 

34 

Src_Addr 

equ 

36 

Endmaskl 

equ 

40 

Endmask2 

equ 

42 

Endmask3 

equ 

44 

Dst Xinc 

equ 

46 

Dst Yinc 

equ 

48 

Dst_Addr 

equ 

50 

X Count 

equ 

54 

Y Count 

equ 

56 

HOP 

equ 

58 

OP 

equ 

59 

Line Num 

equ 

60 

Skew 

equ 

61 
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* BLITTER REGISTER FLAGS 


f HOP Source 

egu 

1 

f HOP Halftone 

equ 

0 

fSkewFXSR 

equ 

7 

fSkewNFSR 

equ 

6 

fLlneBusy 

equ 

7 

fLineHog 

equ 

6 

fLineSmudge 

equ 

5 

* BLiTTER REGISTER MASKS 

mHOP Source 

equ 

$02 

mHOP_Hal f tone 

equ 

$01 

mSkewFXSR 

equ 

$80 

mSkewNFSR 

equ 

$40 

mLineBusy 

equ 

$80 

mLineHog 

equ 

$40 

mLineSmudge 

equ 

$20 


* EnDmAsKdAtA 

* 

* These tables are referenced by PC relative instructions. Thus, 

* the labels on these tables must remain within 128 bytes of the 

* referencing instructions forever. Amen. 

* 

* 0: Destination 1: Source <<< Invert right end mask data >>> 

If endmask: 

dc.w SFFFF 

rt_endmask : 

dc.w S7FFF 

dc.w S3FFF 

dc.w $1FFF 

dc.w SOFFF 

dc.w S07FF 

dc.w S03FF 

dc.w S01FF 

dc.w S00FF 

dc.w $007F 

dc.w $003F 

dc.w $001F 

dc.w $000F 

dc.w $0007 

dc.w $0003 

dc.w $0001 

dc . w $0000 
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* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


TITLE: BLIT iT 


PuRPoSE: Transfer a rectangular block of pixels located at an 

arbitrary X # Y position in the source memory form to 
another arbitrary X,Y position in the destination memory 
form using replace mode (boolean operator 3). 

The source and destination rectangles should not overlap. 

iN: 

a4 pointer to 34 byte input parameter block 

NoTe: This routine must be executed in supervisor mode as access 

is made to hardware registers in the protected region of the 
memory map. 

InPuT pArAmEtEr BlOcK oFfSeTs 


SRC FORM 

egu 

0 ; Base address of source memory form 

.1 

SRC NXWD 

equ 

4 ; Offset between words in source plane 

. w 

SRC NXLN 

equ 

6 ; Source form width 

.w 

SRC NXPL 

equ 

8 ; Offset between source planes 

. w 

SRC XMIN 

equ 

10 ; Source blit rectangle minimum X 

. w 

SRC_YMIN 

equ 

12 ; Source blit rectangle minimum Y 

. w 

DST FORM 

equ 

14 ; Base address of destination memory form 

.1 

, r DST NXWD 

equ 

18 ; Offset between words in destination plane 

. w 

(jjjiST NXLN 

equ 

20 ; Destination form width 

. w 

^T)ST NXPL 

equ 

22 ; Offset between destination planes 

.w 

DST XMIN 

equ 

24 ; Destination blit rectangle minimum X 

. w 

DST_YMIN 

equ 

26 ; Destination blit rectangle minimum Y 

. V) 

WIDTH 

equ 

28 ; Width of blit rectangle 

. w 

HEIGHT 

equ 

30 ; Height of blit rectangle 

. w 

PLANES 

equ 

32 ; Number of planes to blit 

.w 

BLiT_iT: 




lea 

BLITTER 

, a5 ; a5-> BLiTTER register block 



* Calculate Xmax coordinates from Xmin coordinates and width 


move . w 

WIDTH (a4),d6 





subq . w 

#1, d6 

; d6<- width-1 



move . w 

SRC XMIN(a4),dO 

; d0<- src 

Xmin 



move . w 

do,ai 





add.w 

d6, dl 

; dl<- src 

Xmax = src 

Xmin 

+ width- 1 

move . w 

DST XMIN( a4 ) , d2 

; d2<- dst 

Xmin 



move . w 

d2,33 





add.w 

d6, d3 

; d3<- dst 

Xmax - dst 

Xmin 

+ width- 1 
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* 

* 

* 


Endmasks are derived from source Xmin nod 16 and source Xmax mod 16 


moveq . 1 #$0F , d6 


d6<- mod 16 mask 


move . w 
and. w 
add.w 
move . w 


d2,d4 ; d4<- DST_XMIN 

d6,d4 ; d4<- DSTXMIN mod 16 

44, <14 ; d4<- offset into left end mask tbl 

if ' endmask(pc, d4.w) ,d4 ; d4<- left endmask 


move . w d3,d5 » d5<- DST XMAX 

and. w d6,d5 ; d5<- DSTXMAX mod 16 

add.w d5,d5 ; d5<- offset into right end mask tbl 

move . w r tendmask ( pc , d5 . w ) , d5 ; d5<- inverted right end mask 
not.w d5 ; d5<- right end mask 


* 

* Skew value is (destination Xmin mod 16 - source Xmin mod 16) 

* && OxOOOF. Three discriminators are used to determine the 

* states of FXSR and NFSR flags: 

* 

* bit 0 0: Source Xmin mod 16 «< Destination Xmin mod 16 

* 1: Source Xmin mod 16 > Destination Xmin mod 16 

* 


* bit 1 

* 

* 


0: SrcXmax/16-SrcXmin/16 <> DstXmax/16-DstXmin/16 
Source span Destination span 

1: SrcXmax/16-SrcXmin/16 «* DstXmax/16-DstXmin/16 


* 


* bit 2 

* 

* 


0: multiple word Destination span 
1: single word Destination span 


* These flags form an offset into a skew flag table yielding 

* correct FXSR and NFSR flag states for the given source and 

* destination alignments 

* 


* 


move . w 

d2, d7 

and.w 

d6, d7 

and.w 

dO, d6 

sub. w 

d6, d7 

clr .w 

d6 

addx. w 

d6, d6 

lsr.w 

#4, dO 

lsr . w 

#4,dl 

sub.w 

dO, dl 

lsr.w 

#4, d2 

lsr.w 

#4, d3 

sub.w 

d2,d3 

bne 

setendmasks 


d7<- Dst Xmin 

d7<- Dst Xmin modl6 

d6<- Src Xmin modi 6 

d7<- Dst Xmin modl6-Src Xmin modl6 

if Sx&F > Dx&F then cy:l else cy:0 

d6<- initial skew flag table index 

d6[bit0]<- intraword alignment flag 

d0<- word offset to src Xmin 
dl<- word offset to src Xmax 
dl<~ Src span - 1 

d2<- word offset to dst Xmin 
d3<- word offset to dst Xmax 
d3<~ Dst span * 1 
2nd discriminator is one word dst 
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* When destination spans a single word, both end masks are merged ^ 

* into Endmaskl . The other end masks will be ignored by the BLiTTER 


and.w d5,d4 i d4<- single word end mask 

addq.w #4,d6 ; d6[bit2]:l -> single word dst 


set endmasks: 


move . w d4 , Endmaskl ( a5 ) * left end mask 

move . w #$FFFF,Endmask2(a5) ; center end mask 
move.w d5, Endmask3( a5) ; right end mask 


cmp.w dl,d3 ; the last discriminator is the 

bne set count ; equality of src and dst spans 


addq.w #2,d6 


d6[bitl] : 1 *> equal spans 


set count: 


move . w 
addq . w 
move . w 


d3,d4 

#l,d4 

d4 , XCount ( a5 ) 


; d4<- number of words in dst line 
; set value in BLiTTER 


* Calculate Source starting address: 

* 

* Source Form address + 

* (Source Ymin * Source Form Width) + 

* ((Source Xmin/16) * Source Xinc) 


move . 1 

SRC FORM(a4),aO 

7 

a0-> 

move . w 

SRC YMIN(a4),d4 

7 

d4<- 

move . w 

SRC NXLN(a4),d5 

• 

§ 

d5<- 

mulu 

d5,34 

• 

9 

d4<- 

add. 1 

d4, a0 

9 

a0-> 

move . w 

SRC NXWD(a4),d4 

• 

9 

d4<- 

move . w 

d4 , 5rc_Xinc ( a5 ) 

• 

9 


mulu 

d4,d0 

7 

d0<- 

add. 1 

d0,a0 

• 

9 

a0-> 


start of Src form 
offset in lines to Src Ymin 
length of Src form line 
byte offset to (0, Ymin) 

(0, Ymin) 

offset between consecutive 
words in Src plane 

offset to word containing Xmin 
1st src word (Xmin, Ymin) 


* Src Yinc is the offset in bytes from the last word of one Source 

* line to the first word of the next Source line 


mulu d4,dl 
sub.w dl,d5 
move.w d5, Src_Yinc( a5 ) 


dl<- width of src line in bytes 
d5<- value added to pointer at end 
of line to reach start of next 


* Calculate Destination starting address 


move . 1 DST_F0RM( a4 ) , al 
move.w DST_YMIN(a4),d4 
move.w DST NXLN(a4),d5 


; al-> start of dst form 
• d4<- offset in lines to dst Ymin 
; d5<- width of dst form 
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mulu 

d5,d4 

• 

9 

add.l 

d4,al 

• 

9 

move . w 

DST NXWD(a4),d4 

• 

9 

move . w 

d4 , &st_Xlnc ( a 5 ) 

• 

9 

mulu 

d4,d2 

• 

9 

add.l 

d2,al 

m 

9 

Calculate Destination Ylnc 


mulu 

d4,d3 


sub.w 

d3,d5 


move . w 

d5,Dst Yinc(a5) 



d4<- byte offset to (0, Ymin) 
al-> dst (0, Ymln) 

d4<- offset between consecutive 
words in dst plane 

d2<- DST NXWD * (DST_XMIN/16) 
al-> lst^dst word (Xmin, Ymin) 


d3<- width of dst line - DST NXWD 
d5<- value added to dst pointer at 
end of line to reach next line 


* The low nibble of the difference in Source and Destination alignment 

* is the skew value. Use the skew flag index to reference FXSR and NFSR 

* states in skew flag table. 


and.b 

#$0F, d7 

• 

9 

or .b 

skew flags (pc, d6.w) , d7 

• 

9 

move . b 

d7,Slcew(a5) 

« 

9 

move . b 

#mH0P Source, HOP (a5) 

« 

/ 

move . b 

#3,OPTa5) 

• 

9 

lea 

Line Num(a5),a2 

9 

move . b 

#f LineBusy, d2 

• 

9 

move . w 

PLANES ( a4 ) , d7 

9 

bra 

begin 



d7<- isolated skew count 
d7<- necessary flags and skew 
load Skew register 

set HOP to source only 
set OP to "replace" mode 

fast ref to Line_Num register 
fast ref to LineBusy flag 
d7 <- plane counter 
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* 

* 


ThE s E t T i N g Of SkEw FlAgS 


* QUALIFIERS ACTIONS 


* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 

* 


equal Sx&F> 

spans Dx&F FXSR NFSR 

0 0 0 1 

0 1 1 0 | 

10 0 0 
1 1 111 


BITBLT DIRECTION: LEFT -> RIGHT 


. . SSSSSSSSSSSSSS I 8SS8SSSSSS8SSS . . I 

dddddddddd | dddddddddddddddd | dd 

8S8S88888S I 8888888888888888 I 88 

. . dddddddddddddd | dddddddddddddd. . | 

. .88888888888888 I S88S888SS8SSSS. . I 
. . . ddddddddddddd | ddddddddddddddd . | 

. . .88888888888881888888888888888. I 
. . dddddddddddddd | dddddddddddddd . . | 


! 

I 


skew_f lags: 


dc.b 

dc.b 

dc.b 

dc.b 


mSkewNFSR 

mSkewFXSR ; 

0 

mSkewNFSR+mSkewFXSR 


Source span < Destination span 
Source span > Destination span 
Spans equal Shift Source right 
; Spans equal Shift Source left 


* When Destination span is but a single word . . . 


dc.b 

dc.b 

dc.b 

dc.b 


0 

mSkewFXSR 

0 

0 


; Implies a Source span of no words 
; Source span of two words 
; Skew flags aren’t set if Source and 
; Destination spans are both one word 
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nextplane : 

move . 1 aO,Src_Addr(a5) ; load Source pointer to this plane 

move.l al,Dst Addr(a5) ; load Destination ptr to this plane 

move . w HEIGHTTa4),Y_Count(a5) ; load the line count 


move.b #mLineBusy, (a2 ) ; <<< start the BLiTTER >>> 


add.w SRC_NXPL(a4),aO ; a0-> start of next src plane 

add.w DST_NXPL(a4),al ; al-> start of next dst plane 


* The BLiTTER is usually operated with the HOG flag cleared. 

* In this mode the BLiTTER and the ST's cpu share the bus equally, 

* each taking 64 bus cycles while the other is halted. This mode 

* allows interrupts to be fielded by the cpu while an extensive 

* BitBlt is being processed by the BLiTTER. There is a drawback in 

* that BitBlts in this shared mode may take twice as long as BitBlts 

* executed in hog mode. Ninety percent of hog mode performance is 

* achieved while retaining robust interrupt handling via a method 

* of prematurely restarting the BLiTTER. When control is returned 

* to the cpu by the BLiTTER, the cpu immediately resets the BUSY 

* flag, restarting the BLiTTER after just 7 bus cycles rather than 

* after the usual 64 cycles. Interrupts pending will be serviced 

* before the restart code regains control. If the BUSY flag is 

* reset when the Y_Count is zero, the flag will remain clear 

* indicating BLiTTER completion and the BLiTTER won't be restarted. 

* 

* ( Interrupt service routines may explicitly halt the BLiTTER 

* during execution time critical sections by clearing the BUSY flag. 

* The original BUSY flag state must be restored however, before 

* termination of the interrupt service routine. ) 


restart: 


bset.b 

d2, (a2) 

nop 


bne 

restart 

* 


begin: dbra 

d7 , next_plane 

rts 



; Restart BLiTTER and test the BUSY 
; flag state. The "nop" is executed 
; prior to the BLiTTER restarting. 

; Quit if the BUSY flag was clear. 
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Appendix B — Extended ST BIOS Blit Configuration 


0x40 Blitmode - Get/Set Blit Configuration 

Synopsis: WORD Blitmode ( flag ) 

WORD flag; 


Extended ST BIOS (trap #14) function number 0x40 (64 

decimal) gets and sets the blit configuration. If FLAG is 
-1 (Oxffff), then no set operation is performed, and the 
current blit configuration is returned. If FLAG is not -1, 
then the blit configuration is set as follows: 

bit 0: 0: set blit mode to soft (use software) 

1: set blit mode to hard (use BLITTER) 

bits 1..14: undefined, reserved 

bit 15: must be zero 


The previous blit configuration is returned in the low 
word of DO. The fields are: 


bit 

0: 

0: blits are being done in software 
1: blits are being done in hardware 

bit 

1: 

0: no BLiTTER is available 

1: a BLiTTER is installed in the system 

bits 2. .14: 

undefined, reserved, may be zero or one 
on return. 

bit 

15: 

always returned as zero 


If an attempt is made to set the blit mode to "hard" on 
a system that does not contain a BLITTER, the mode is forced 
to "soft". 

The reserved fields are for future blit capabilities 
and other graphics chips. They should be treated as "don't 
care" fields and should be maintained (intact) because they 
will acquire meaning in the future. 

This call works on all ROM versions of the operating 
system. 
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EXAMPLE CALL USING C 


# define Blitmode(a) xbios(64,a) 


curmode «= Blitmode ( -1 ) ; 
Blitmode ( curmode | 1 ) ; 
do stuff ( ) ; 

BlXtmode ( curmode ) ; 


/* get current mode */ 

/* turn on BLiTTER */ 

/* ... do some processing */ 
/* restore blit state */ 


EXAMPLE CALL USING 68000 ASSEMBLY 


* 

* 

* 


move.w #-l,-(sp) 
move . w #$40, -(sp) 
trap #14 

addg #4 , sp 

move . w dO , - ( sp ) 
or.w #l,d0 

move . w dO , - ( sp ) 
move.w #$40, -(sp) 
trap #14 

addq #4,sp 

... do some processing 

move.w #$40, -(sp) 
trap #14 

addq #4 , sp 


DO = Blitmode(-l) 


save old blit state 
make sure it's on 
Blitmode (DO) 


restore old blit mode 
from stacked old-state 
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